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1. Abstract 

Benford's law states that the occurrence of significant digits in many data sets is not 
uniform but tends to follow a logarithmic distribution such that the smaller digits 
appear as first significant digits more frequently than the larger ones. We investi- 
gate here numerical data on the country-wise adherent distribution of seven major 
world religions i.e. Christianity, Islam, Buddhism, Hinduism, Sikhism, Judaism and 
Baha'ism to see if the proportion of the leading digits occurring in the distribution 
conforms to Benford's law. We find that the adherent data of all the religions, ex- 
cept Christianity, excellently does conform to Benford's law. Furthermore, unlike 
the adherent data on Christianity, the significant digit distribution of the three ma- 
jor Christian denominations i.e. Catholicism, Protestantism and Orthodoxy obeys 
the law. Thus in spite of their complexity general laws can be established for the 
evolution of the religious groups. 

2. Keywords 

Benford's law; religion; adherents 

3. Introduction 

In our daily life, we come across huge sets of numerical data being generated due to 
different activities both human as well as natural. What could be common between 
two seemingly unrelated data sets like the size of files stored on a computer and the 
geographical area of the countries? An intuitive reply would be nothing. However, 
the occurrence of significant digits in many data sets is governed by a common 
logarithmic distribution which is defined by the equation called Benford's law [1]. 
The first significant digit of a number is the first non-zero digit on its extreme left 
like 5 for 521 and 1 for 0.0184 respectively. The establishment of Benford's law was 
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based on the curious observation made by Simon Newcomb who observed and said 
that the initial pages of the logarithmic tables were more worn out than the later 
ones due to the fact that people look more often for numbers with smaller first non- 
zero digits [2] . The observation was popularized by Frank Benford (hence the name 
Benford's law) who then analyzed a large number of unrelated data sets collected 
from diverse fields e.g. the physical constants, the atomic and molecular masses, the 
areas of countries, the length of rivers etc., and concluded that the occurrence of 
first significant digits in all his data sets does follow a logarithmic distribution and 
hence established the law in the form of following mathematical equation [1] . 

P(rf)=Zo5io(l + J),d=l,2,3...,9 (1) 

where P(d) is the probability of a number having the first non-zero digit d. 
Benford's law is also known as "first digit law" or "law of the leading digits" . Ac- 
cording to equation (1), in a given data set the probability of occurrence of a certain 
digit as first significant digit decreases logarithmically as the value of the digit in- 
creases from 1 to 9. Thus digit 1 should appear as the first significant digit about 
30% times, digit 2 about 17% times and similarly 9 should appear about 4% times. 
This observation is in stark contrast to the equal probability of about ^ =11% for 
each digit from 1 to 9 if all are equally likely to occur as the first significant digit. 
Despite many of its properties like base and scale invariance being unraveled [3,4] 
over more than a century after its discovery a complete explanation of Benford's law 
has remained elusive [5]. However, increasing number of data sets from unrelated 
phenomena have been tested to conform to this law with high statistical accuracy. 
After remaining obscure for decades, the law is finding wide spread applications. In 
economics, it has been validated for stock market prices [6] and applied in detect- 
ing the fraudulent data submitted by the companies [7] , verifying the reliability of 
macroeconomic data of the countries [8] and in the determination of winning bids 
in e-Bay auctions [9]. In physics the law has been successfully applied to numerical 
data on physical constants [10], atomic spectra [11], decay width of hadrons [12], 
magnitude and depth of earthquakes [6,13] and mantissa distribution of pulsars [14]. 
The law has been utilized in biological sciences to check the veracity of the data on 
clinical trials and discovery of drugs [15, 16], study of diseases and genes [17]. The 
law is used in optimizing the size of computer files [18], enhancing the computing 
speed [19] and appearance of numbers on the internet [20]. On the social sciences 
front the law has been used to detect election frauds and anomalies [21]. 
We show here the applicability of Benford's law to the field of religious demography. 
Though very important, the research on religious demography is still in infancy and 
admittedly inconsistencies in world religion adherent data cannot be ruled out com- 
pletely. Besides reshaping the social landscape, the change in religious demographics 
is also affecting the political decisions of various countries. To prevent problems like 
making a flawed policy decision due to the use of some erroneous data it is the 
obligation of the researchers to reassure the policy makers about the quality of 
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the data. We analyze the available data on the country-wise adherent distribution 
of seven major world religions Christianity, Islam, Buddhism, Hinduism, Sikhism, 

Judaism and Baha'ism. We find that country-wise adherent data on all religions 
except Christianity follow Benford's law. We also reveal the validity of Benford's 
law on Catholicism, Protestantism and Orthodoxy, the three major denominations 
of Christianity. 

4. Data 

Due to the developmental stage of the religious demography research, obtaining 
complete data on adherents of a particular faith and its distribution from a single 
source is not quite easy. The Wikipedia combines the available data from different 
sources like the CIA World Factbook, US State Department's International Reli- 
gious Freedom Report, Pew Research Center, Adherents.com, Joshua Project etc. 
to arrive at comprehensive data tables for the country-wise adherent distribution of 
major world religions. The data on country-wise number of Muslims are taken from 
Pew Research Center study on the size and distribution of world's Muslim popula- 
tion [22,23]. The number of adherents of Baha'i faith in each country is based on 
the 2005 estimates of World Christian Database as reported in The Association of 
ReUgion Data Archive (ARDA) [24]. 

4.1. Data analysis and Results 

The detailed statistical analysis of the country-wise adherent distribution is shown 
in Tables 1 and 2. The Nobs^ the number of counts appearing in the corresponding 
data set as first significant digit, against each digit from 1 to 9 are shown for 
each religion in columns 2, 3, 4, 5, 6 of Tables 1 and 2. We also show A^Ben, the 
corresponding counts (in brackets) for each digit as predicted by Benford's law: 

NBen = Nlogwil + ^) (2) 

along with the root mean square error (A7V) calculated from the binomial distribu- 
tion [12] 

AN = y'NP{d){l-P{d)) (3) 

where N for each religion is the total number of countries its adherents are reported 
to exist. For example, as shown in column 2 of Table 1, the adherents of Christianity 
are reported in 205 territories [25] . The observed count for digit 1 as first significant 
digit is 50 whereas the expected count from Benford's law is 61.7 with an error of 
about 6.6. To evaluate the degree of agreement between the observed and expected 
first digit distributions let us first write the Null Hypothesis, Hq that the observed 
distribution of the first significant digit in each of the case we consider is same as 
expected on the basis of Benford's law. To test the null hypothesis we carry out the 
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Pearson's -y^ test. 



n 



{Nobs - NBen? 



Nb. 



(4) 



In our case n = 9 which means we have n — 1 = 8 degrees of freedom. Under 

95% confidence level (CL) x^(8)=15.507 which is the critical value for acceptance or 
rejection of null hypothesis. If the value of the calculated is less than the critical 
value then we accept the null hypothesis and conclude that the data fits Benford's 
law. 

Christianity has the largest number of adherents spread across 205 countries (col- 
umn 2 of Table 1). The x^= 16.419 (the last row and column 2 of Table 1) is larger 
than x^(8) = 15.507 and thus forcing us to reject the null hypothesis indicating in 
turn that the data on the country-wise adherent distribution of Christianity does 
not follow Benford's law. We further checked the behavior of three largest Christian 
denominations, Catholicism (197 countries) [26], Protestantism (171 countries) [27] 
and Orthodoxy (42 countries) [28] as shown in Columns 3, 4 and 5. Surprisingly their 
respective values 10.143, 5.208, 6.946 turn out to be lower than x^(8) = 15.507 
and hence null hypothesis must be accepted. In column 6 of Table 1 we show the 
counts of significant digits of Muslim populations from 184 countries as reported in 
a comprehensive demographic study of Muslims conducted by Pew Research Cen- 
ter [22,23]. The Pearson's = 10.646 is well below the critical value of 15.507 and 
null hypothesis is accepted. In Table 2 we show the data on Baha'ism (175 coun- 
tries) [24], Buddhism (129 countries) [29], Hinduism (97 countries) [30], Sikhism (47 
countries) [31] and Judaism (107 comitrios) [32]. The smaller values of the Pear- 
son's x^ make the null hypothesis acceptable and hence Benford's law describes 
the distribution of significant digits in each case. The Wikipedia table on the num- 
ber of adherents of Buddhism shows two values for some countries. Accordingly we 
consider both the situations taking into account the lower and the upper values 
separately. When smaller values are counted for the Buddhism (not shown here), 
the Pearson's x^ 6.633 turns out to be smaller than the critical value. The lowest 
value of x^ 1.786 occurs for Buddhism when only the larger values are considered for 
countries where two values are given. Further, although the sample size of the ad- 
herent data on Sikhism is quite small, reported only in 47 countries, the x^ = 6.863 
indicates that Benford's law upholds. Finally in Table 3 we sum up the number of 
counts for significant digits appearing in the adherent data sets on all the religions 
considered in our study. The Pearson's x^=''.310 being smaller than the critical 
value 15.507 shows that Benford's law describes the adherent data of major world 
religions with a reasonable accuracy. The results reported in the tables can readily 
be appreciated from Figs. 1-3. It can be seen that the occurrence of the signifi- 
cant digits in country- wise adherent distribution of religions closely resembles those 
predicted by Benford's law. 
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Table 1: The significant digit distribution of country-wise Christian, Catholic, 
Protestant, Orthodox and Muslim populations 



First Digit 


Christian 


Catholic 


Protestant 


Orthodox 


Muslim 




(205) 


(197) 


(171) 


(42) 


(184) 


1 


50 (61.7±6.6) 


47 (59.3±6.4) 


48 (51.5±6.0) 


12 (12.6±3.0) 


60 (55.4±6.2) 


2 


30 (36.1±5.4) 


34 (34.7±5.3) 


25 (30.1±5.0) 


5 (7.3±2.5) 


43 (32.4±5.2) 


3 


29 (25.6±4.7) 


31 (24.6±4.6) 


26 (21.4±4.3) 


7 (5.2±2.1) 


21 (23.0±4.5) 


4 


27 (19.9±4.2) 


18 (19.1±4.1) 


15 (16.6±3.9) 


3 (4.1±2.0) 


13 (17.8±4.0) 


5 


14 (16.2±3.9) 


19 (15.6±3.8) 


18 (13.5±3.5) 


4 (3.3±1.7) 


7 (14.6±3.7) 


6 


15 (13.7±3.6) 


10 (13.2±3.5) 


10 (11.4±3.3) 


4 (2.8±1.6) 


11 (12.3±3.4) 


7 


18 (11.8±3.3) 


10 (11.4±3.3) 


9 (9.9±3.0) 


5 (2.4±1.5) 


13 (10.7±3.2) 


8 


6 (10.5±3.1) 


15 (10.1±3.1) 


12 (8.7±2.9) 


2 (2.1±1.4) 


10 (9.4±3.0) 


9 


16 (9.4±3.0) 


13 (9.0±2.9) 


8 (7.8±2.7) 


(1.9itl.3) 


6 (8.4±2.8) 


Pearson 


16.419 


10.143 


5.208 


6.946 


10.646 



Table 2: The significant digit distribution of country-wise Baha'i, Buddhist, Hindu, 
Sikh and Jewish populations 



First Digit 


Baha'i 


Buddhist 


Hindu 


Sikh 


Jew 




(175) 


(129) 


(97) 


(47) 


(107) 


1 


61 (52.9±6.1) 


38 (38.8±5.2) 


26 (29.2±4.5) 


12 (14.1±3.1) 


42 (32.2±4.7) 


2 


35 (30.8±5.0) 


21 (22.7±4.3) 


16 (17.1±3.7) 


11 (8.3±2.6) 


13 (18.8±4.0) 


3 


20 (21.9dz4.4) 


15 (16.1±3.7) 


12 (12.1d=3.2) 


6 (5.9±2.3) 


15 (13.4±3.4) 


4 


11 (16.9±3.9) 


15 (12.5±3.4) 


10 (9.4±2.9) 


2 (4.5±2.0) 


6 (10.4±3.1) 


5 


10 (13.8±3.6) 


10 (10.2±3.1) 


8 (7.7±2.6) 


5 (3.7±1.8) 


12 (8.5±2.8) 


6 


16 (11.7±3.3) 


8 (8.6±2.8) 


11 (6.5±2.5) 


1 (3.1±1.7) 


7 (7.2±2.6) 


7 


9 (10.1±3.1) 


10 (7.5±2.6) 


3 (5.6±2.3) 


5 (2.7±1.6) 


6 (6.2±2.4) 


8 


5 (8.9±2.9) 


7 (6.6±2.5) 


3 (5.0±2.1) 


2 (2.4±1.5) 


3 (5.5±2.3) 


9 


8 (8.0±2.8) 


5 (6.0±2.4) 


8 (4.4±2.0) 


3 (2.1±1.4) 


3 (4.9±2.2) 


Pearson x"^ 


8.648 


1.786 


8.45 


6.863 


10.158 



Table 3: The significant digit distribution of country-wise adherents of all religions 



First Digit 


1 


2 


3 


4 


5 


6 


7 


8 


9 


Total 


Nobs 


289 


169 


118 


84 


66 


69 


64 


36 


49 


944 


NBeu 


284.2 


166.2 


117.9 


91.5 


74.7 


63.2 


54.7 


48.3 


43.2 




Error 


14.1 


11.7 


10.2 


9.1 


8.3 


7.7 


7.2 


6.8 


6.4 




Pearson x^ 




















7.766 



In the case of any major deviations from the predictions of Benford's law, the 
distributions as shown in figures could hardly be revealed. Particularly from Fig. 
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Fig. 1: Observed and Benford distributions of significant digits for country- wise 
adherents of Christianity, its three major denominations i.e. Catholicism, Protes- 
tantism and Orthodoxy, Islam and Baha'ism 
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Fig. 2: Observed and Benford distributions of significant digits for country-wise 
adherents of Buddhism, Hinduism, Judaism and Sikhism 



3 it is clear that agreement of the observed proportion of occurrence of significant 
digits for all the religions to those predicted by Benford's law is excellent. 



5. Discussion 

Despite being a key social attribute and the largest scale phenomenon involving 
people beyond the geographical boundaries of countries, the adherence to a reli- 
gion, has received relatively lesser scientific attention [33]. Furthermore, religion 
being a dominant force in society for both cohesion and conflict [34] , when it comes 
to the scientific study of religions the most debated question, having serious social 
and political consequences, is the estimation of size distribution of different reli- 
gious groups [23,35,36]. Studies have shown that the growth dynamics of religious 
activities are similar to those found in economic and scientific activities [37]. The 
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Fig. 3: Observed and Benford distributions of significant digits for country-wise 
adherents of all religions 

adherent distribution of major world religions has been shown to obey Zipf's and 
Pareto distributions and their evolution has been described using the same equa- 
tions as for the crystallization process [38,39]. A recent mathematical model has 
predicted the extinction of religion in societies which perceive the utility of not ad- 
hering to be greater than the utility of adhering [40]. 

Here we have shown that populations of major world religions follow Benford's law. 
It is worthwhile to mention here that F. Benford used population size of U.S. towns 
as input data to test and examine the accuracy of his observation [1]. Later on, the 
population of the 198 countries of world have been shown to follow the first digit 
law [41]. However, being undertaken by the individual governments, the census esti- 
mates of the total populations of countries are expected to be reliable. Same is not 
true for the estimates of adherents of different religions. Several religions compete 
with each other to increase the number of their adherents. Such followers can either 
be converted from one religion to another or become followers of a particular reli- 
gion though previously being associated with none [42]. This continuous evolution 
of religions puts serious restraints on determining the exact number of adherents of 
a particular religion, a task which is further complicated due to government favor 
to state religions in certain countries, government restriction on religious census or 
biased data from different religious associations for political propaganda [43] . Thus 
errors are expected to be found in the data on religious demographics and data 
must be subjected to scientic scrutiny for any alterations. 

Benford's law holds for those data sets in which the occurrence of numbers is free 
from any restrictions. It has been found that tampered, unrelated or fabricated 
numbers usually do not follow Benford's law [44]. Significant deviations from the 
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Benford distribution may indicate fraudulent or corrupted data [45] . In the light of 
these issues it would be interesting to check if the adherent data of major world 

rehgions submit to Benford's law and any departure from the law may alert the 
researcher about possible data misrepresentation. 

In our study we observed that the adherent data of three major Christian denomi- 
nations follow Benford's law. However, when Christianity is considered as a single 
religious group, the distribution of the significant digits of the adherent data devi- 
ates from the predictions of Benford's law. The deviation could possibly arise from 
the incorrect method used or erroneous grouping of data from different sources in 
Wikipedia. Further, the deviation from the Benford distribution may also arise from 
overestimation/underestimation of the number of Christian adherents in some coun- 
tries. The absence of a single religion database has forced many people including 
scholars to turn to Wikipedia for information on major world religions. But rele- 
vant Wikipedia data has been assumed to be devoid of sufficient academic rigor and 
consistency [23]. However, the prevalence of Benford's law in the adherent data on 
three largest Christian denominations, Buddhism, Hinduism, Sikhism, Judaism and 
Baha'ism infuses some confidence in the quality of the data available on Wikipedia 
and the method employed to collect and combine the data from different sources to 
arrive at these comprehensive adherent data tables. Now in the last decade, Muslim 
societies have generated considerable public interest and their population has come 
under increased scrutiny [22,23]. There are varying estimates of the size of world's 
Muslim population. We analyzed the estimates of the global Muslim population 
reported in the most recent study done by the Pew Research Center and found 
a reasonable agreement to Benford's law which in turn indicates the reliability of 
this demographic survey. The validity of Benford's law for adherent distribution 
of major world religions is a further hint that universal growth mechanisms might 
exist which can give rise to general laws independent of the particular details of the 
systems [37]. 



6. Conclusion 

The religion is the largest scale phenomenon in a society that has a bearing on 

social, economical and political behavior of its adherents. We applied, for the first 
time, Benford's law to the data on the adherent distribution of seven major world 
religions. We found that the adherent data of Islam, Hinduism, Buddhism, Sikhism, 
Judaism and Baha'ism follow Benford's law. Similarly, the adherent data of the 
three major Christian denominations Catholicism, Protestantism and Orthodoxy 
also follow the law but contrarily cumulative adherent data of Christianity when 
considered as a single religious group does not. Notwithstanding the complex path 
taken by a religious group to attain a certain state, its macroscopic behavior satisfies 
the simple law of leading digits. 
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